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1. INTRODUCTION 



1.1. Elliptic curves 



P~^ ' An elliptic curve E over Q is the projective closure of a curve y"^ = x^ + Ax + B for 

^^. • some fixed A, B E Q satisfying AA^ + 275^ 7^ (the inequality is the condition for the 

"^ ■ curve to be smooth). Such curves are interesting because 

a ' 

1. they are the simplest algebraic varieties whose rational points are not completely 

understood, and 

2. they are the simplest examples of projective algebraic groups of positive dimension. 
> 



^ I The abelian group -^(Q) of rational points on E is finitely generated |Mor22| . Hence 



00 ! E{Q) ^ Z*" © T for some nonnegative integer r (the rank) and some finite abelian 



group T (the torsion subgroup). The torsion subgroup is well understood, thanks to 
B. Mazur |Maz77] . but the rank remains a mystery. Already in 1901, H. Poincare |Poi01t 



p 

cn 

o 

CN ' p. 173] asked what is the range of possibilities for the minimum number of generators 

of E{Q), but it is not known even whether r is bounded. There are algorithms that 
compute r successfully in practice, given integers A and B of moderate size, but to 

r> ' know that the algorithms terminate in general, it seems that one needs a conjecture: 

C^ . either the finiteness of the Shafarevich-Tate group III (or of its p-primary part for some 

prime p), or the Birch and Swinnerton-Dyer conjecture that r equals the analytic rank 
Tan := ord,=i L{E, s) |BSD65j . 



The main results of Bhargava and Shankar (Section II. 4p concern the average value 
of r as ii^ ranges over all elliptic curves over Q. 

1.2. Selmer groups 

There is essentially only one known proof that E{Q) is finitely generated. The hardest 
step involves proving the finiteness of E{Q)/nE{Q) for some n > 2. This is done by 
embedding E{Q)/nE{Q) into the n-Selmer group Sel„(-E'), which we now define. 
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For each prime p, let Qp be the field of p-adic numbers; also define Qoo := M- Let Q 
be an algebraic closure of Q. We write H^(Q, E'), for example, to denote the profinite 
group cohomology H^(Gal(Q/Q), E(Q)). 

Fix n > 2. For any abelian group or group scheme G, let G[n\ be the kernel of 
multiplication-by-n on G. Taking cohomology of 

— > E[n] — > E ^ E^O 

over Q and Qp leads to the exact rows in the commutative diagram 

" nE{Q) 



4tSv Hi(Q. ^M) Hi(Q, E)[n] 



(1) 



E 



/3 



0— ^ n 5v^ -^ n H^(Q.'^N) ^ n ^'(q^^em^o 



p<oo ^ '^^ p<oo p<oo 



The group H^(Q,£'[n]) turns out to be infinite, and it is difficult to determine which of 
its elements are in the image of E{Q)/nE{Q). But because arithmetic over Qp is easier 
than arithmetic over Q, one can determine which elements are locally in the image. 
With this in mind, define 

Sel„,(E) := {x G E\Q,E[n]) : /3{x) G image(a)} 

Diagram ([1]) shows that the subgroup Sel„(£') C H^(Q, E[n]) is an upper bound for the 
image of E(Q)/nE{Q). In fact, if we define also the Shafarevich— Tate group 

m = m{E) :=ker Ir\Q,E) ^ J] R\Qp, E)] , 

\ p<oo / 

then diagram ([1]) yields an exact sequence 



(2) ^ J^ -^ SeUE) -. m[n] -. 0. 



Moreover, it turns out that Sel.„(-E) is finite and computable. 

1.3. Averaging over all elliptic curves 

The average of an infinite sequence of real numbers ai,a2,... is defined as 
Imin^ao^ai + ■ ■ ■ + an)/n, if the limit exists. This may depend on the ordering of 
the terms. Hence, to define the average rank of elliptic curves, we should first decide 
how to order them. 

Tables such as |BK75t [Cre97t ICrej ISte) order elliptic curves by their conductor A^. 
But it is not known even how many elliptic curves have conductor < X asymptotically 
as X — )■ oo, so we cannot hope to prove anything nontrivial about averages for this 
ordering. Ordering by minimal discriminant runs into the same difficulty. 
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Therefore we order by height, which we now define. Elhptic curves y'^ = x^ + Ax + B 
and if^ = x^ + A'x + B' over Q are isomorphic if and only if there exists g G Q'^ 
such that [A'^B') = {q'^A,q^B). Therefore each isomorphism class contains a unique 
representative Eab with {A,B) G Z^ minimal in the sense that there is no prime p 
with p^\A and p^\B. Let S' be the set of all such Eab- Define the (naive) height 
H{Eab) = H{A,B) := max{\AA^\,27B^}. (Other authors replace 4 and 27 by other 
positive constants; it is only the exponents that matter in the proofs.) For X G M, 
define S'^x := [E e S : H{E) < X}. For any 0: S" -^R, define its average by 



Average (0) := lim 



^^°° Eeg,?^^ 1 



if the limit exists. Define Average(0) and Average(</)) similarly, but using lim sup or 
liminf, respectively. 

We may speak also of the probability or density of the set of elliptic curves satisfying 
a given property. Namely, the property P can be identified with its characteristic func- 
tion xp: ^ — > {0,1}; then define Prob(P) = Average(xp)- Similarly define Prob(P) 
and Prob(P). 

Example 1.1. — B. Mazur's theorem |Maz77| bounds the possibilities for the torsion 
subgroup T. The Hilbert irreducibility theorem shows that each nonzero possibility for 
T occurs rarely. Together, they show that Prob(T ^ 0) is 0. 

1.4. Main results of Bhargava and Shankar 

Theorem 1.2 f IBSlOal Theorem 1.1]). — Average(#Sel2) = 3. 

If one averages not over all of (f , but over a subset defined by finitely many congruence 
conditions on A and B (e.g., A = 5 (mod 7) and P = 3 (mod 4)), then the average 
is still 3 |BS10a| Theorem 1.3]. This is interesting, given that one of the successful 
techniques for constructing elliptic curves of moderately large rank has been to restrict 
attention to congruence classes so as to maximize ^E{¥p) for the first few primes p 
|Mes82j . 



A similar argument leads to 



Theorem 1.3 f jBSlObl Theorem 1]). — Average (# Selg) < 4. 

Again one can obtain the same bound for elliptic curves satisfying finitely many congru- 
ence conditions. One can even impose congruence conditions at infinitely many primes 
as long as one can show that the conditions at large primes together are sieving out a 
negligible subset. 

It is still not known whether Average(r) exists, but Theorems II .21 and II .3l yield upper 
bounds on Average (r): 



Corollary 1.4 r |BS10bl Corollary 2]). — Average(r) < 7/6. 
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Proof. — Let s = dimSela. The injection E{Q)/3E{Q) --^ Seh^E) yields r < s. 
Combining this with 6s — 3 < 3* bounds r in terms of # Sels; then apply Average and 
use Theorem 11.31 (Why 6s — 3? Since 3* is a convex function, it suffices to connect the 
points (s, 3^*) for s = 0, 1, ... in order by line segments, and to take the equation of the 
line segment that crosses the horizontal line y = 4.) D 

Further consequences of Theorem 11.31 make use of results of Dokchitser-Dokchitser 
and Skinner-Urban, whose context can be best understood if we introduce a few more 
quantities. Taking the direct limit of ([2]) as ra ranges through powers of a prime p yields 
the p°°-Selmer group Se\poo[E) fitting in an exact sequence 

(3) -^ E{Q) ® ^ ^ Selpoo (E) -^ m[p~] -^ 0. 

£jp 

Each term in ([3]) has the form (Qp/Zp)^ © (finite) for some nonnegative integer c called 
the corank. Let rp^ := corankSelpoo(£'). Let s'p := dimSelp(£') — dim. E[p]{Q) . If LQ' is 
the quotient of LQ by its maximal divisible subgroup, then s'— rpoo = dimLQ'[p], which 
is even since ni'[p°°] is a finite group with a nondegenerate alternating pairing |Cas62] . 
By ([3]), Tpoo — r = corank LE [p°°] , which is if and only if L[I[p°°] is finite. To summarize, 

rA\ / _ ST BSD 

(,4j Sp = Tpoo — r — Tan, 

where the congruence is modulo 2, and the equalities labeled with the initials of 
Shafarevich-Tate and Birch-Swinnerton-Dyer are conjectural. Also, 

(5) dimSelp > s^ > Vpoo > r. 

In the direction of the conjectural equality Tpoo = ran, we have two recent theorems: 

Theorem 1.5 ( |DD10i Theorem 1.4]). — For every elliptic curve E overQ, we have 

Tpoo = Tan (mod 2). 

The root number w G {±1} of an elliptic curve E over Q may be defined as the sign 
of the functional equation for the L-function L{E, s), so the conclusion of Theorem 11.51 
may also be written (— 1)*^ = w. 

Theorem 1.6 ( |SU10t Theorem 2(b)]). — For any odd prime p and elliptic curve E 
over Q satisfying mild technical hypotheses, if rpoa = 0, then ran = 0. 



Combining Theorems 11.31 (with congruence conditions). II. 5[ and 11.61 leads to 

Theorem 1.7 r |BS10b[ §4.1,4.2]). — 

(a) Prob fdim Sels = Sg = r^oo = r = r^n = 0) is positive. 

(b) Prob (dim Sels = Sg = r^oo = 1) is positive. 
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Sketch of proof. — First one constructs a positive-density subset ^ C <f such that 
whenever E G ^, its (— l)-twist is in ^ and has the opposite root number. By 
Example II. ![ one can assume that i?(Q)[3] = for every E G <^. Then, by the 
sentence after Theorem II. 5[ the parity of dim Sels = Sg for E' G ^^ is equidistributed. 
Moreover, ^ can be chosen so that Theorem 11.61 apphes to every E G ^, and so that 
the conclusion of Theorem 11.31 holds for ^. For large X, let po be the proportion of 
curves in ^ with dim Sels = 0; define pi similarly. Our bound on the average of ^ Sels 
yields 

Po-l+Pi-S+Q-po) ■9+Q-P1V27 < 4 + 0(1). 

This, with po,pi < 1/2 + o(l), implies pq > 1/4 — o(l) and pi > 5/12 — o(l). This 
proves the bounds for dim Sels = Sg. For E G t^, if S3 = 0, then rsoo = r = r^n = too 
by ([5]) and Theorem 11.61 If S3 = 1, then r^oc = 1 too since Sg — r^oo G 2Z>o. D 

Theorem 1 1.7( lai) implies in particular that a positive proportion of elliptic curves over 
Q have both rank and analytic rank and hence satisfy the Birch and Swinnerton- 
Dyer conjecture that r = Tan- Theorem I1.7l1 b|) implies a conditional statement for 
rank 1: 

Corollary 1.8 ( [BSlObj §4.1]). — IfUI{E) (or at least its 3-primary part) is finite 
for every elliptic curve E over Q, then Prob(r = 1) is positive. 



2. PREVIOUS WORK OF OTHER AUTHORS 

This section exists only to put the theorems above in context. Readers impatient to 
understand the proof of Theorem 11.21 may jump to Section 121 

2.1. Average analytic rank 

Using analogues of Weil's "explicit formula", many authors have given conditional 
bounds on the average analytic rank, both for the family of quadratic twists of a fixed 
elliptic curve over Q, and for the family ^ of all elliptic curves over Q. All these 
analytic results over Q are conditional on the Riemann hypothesis for the L-functions 
of the elliptic curves involved. At the time that some of these results were proved, the 
assertion that the L-function admits an analytic continuation to C was an assumption 
too, but today this is a consequence of the theorem that all elliptic curves over Q are 
modular |BCDT01| . 



D. Goldfeld |Gol79] proved the conditional bound Average(ran) < 3.25 for the fam- 
ily of quadratic twists of a fixed elliptic curve E over Q, and conjectured that the 
correct constant was 1/2. The constant 3.25 was later improved to 1.5 by D. R. Heath- 
Brown |HB04t Theorem 3]. 

For the family of all elliptic curves over Q, A. Brumer proved the conditional bound 
Average(ran) < 2.3 |Bru92] . He also proved the same bound for elliptic curves over ¥g{t) 
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unconditionally. In the case of ¥q(t), the inequahty r < ran is known, so one deduces 
Average(r) < 2.3 in this setting. Over Q, the constant 2.3 was improved to 2 by Heath- 
Brown |HB04t Theorem 1] and to 25/14 by M. Young |You06) . The latter imphed the 
(conditional) positivity of Prob(ran < 1), and then also of Prob er = ran < 1), because 
ran < 1 implies r = ran 0Kol88[lKd90l[CTZ86] with |BFH90| or |MM91| ;i. 

Conditional bounds on Average(ran) for other algebraic families of elliptic curves 
and abelian varieties have been given by E. Fouvry and J. Pomykala |FP93| . 
P. Michel |Mic95[lMK^ . J. Silverman [Sil98] . and R. Wazir |Waz04| . 



2.2. Distribution of Selmer groups 

For the family of elliptic curves y'^ = x^ + k over Q, E. Fouvry proved Average (3*"/^) < 
oo, by bounding the average size of the Selmer group associated to a 3-isogeny (a slight 
generalization of the Selmer groups we have considered so far) |Fou93) . This implies 



that Average (r) < oo in this family. 

Recall our notation s' := dimSelp(-E) — dimii^(Q)[p]. For the family of quadratic 
twists oi y'^ = x^ — X over Q, Heath-Brown proved not only that Average(s2) = 3 but 
also that 

(6) Prob (4 =d)= ( n(i + 2"^rM ( n ^ 

\j>0 / \i=i 



for each d G Z>o |HB93[[HB94] . P. Swinnerton-Dyer |SD08] and D. Kane |Kanll] 



generalized this by obtaining the same distribution for the family of quadratic twists 
of any E over Q with £^[2] C E{Q) but no rational cyclic 4-isogeny. Heath-Brown's 
approach was used also by G. Yu |Yu06| to prove finiteness of Average (# Sel2) for the 
family of all elliptic curves with E[2] C E{Q). In certain subfamilies of this, surprises 
occur: see |Yu05| . 

A probabilistic model predicting the distribution of s'^ for any prime p was proposed 
in |PR12] : for p = 2 the prediction is consistent with Q. 

Earlier, C Delaunay |Del01) proposed a heuristic for the distribution of #111, in 
analogy with the Cohen-Lenstra heuristics |CL84 . 



Finally, there is a conjecture that elliptic curves tend to have the smallest rank 
compatible with the root number, which is expected to be equidistributed. This was 
proposed in |Gol79| for the case of quadratic twists of a fixed curve, but it is probably 
true more generally. In other words, it is expected that Prob(r = 0) and Prob(r = 1) 
are both 1/2. See also |KS99b[ §5] and |KS99a| . 

The three predictions above are compatible with the equation s' = dim III [p] -|- r 
arising from (j2]): see |PR12t §5]. 

2.3. Average size of Selmer groups over function fields 

The closest parallel to the work of Bhargava and Shankar is a 2002 article by A. J. de 
Jong proving the analogue of Theorem 11.31 for function fields, with a slightly weaker 
bound |dJ02] . Namely, for any finite field Fg of characteristic not 3, de Jong proved 
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Average(7^ Sels) < 4 + e{q) for the family of all elliptic curves over Fq(t), where e{q) is 
an explicit rational function of q tending to as g — )■ oo. This implies a corresponding 
bound for Average (r) for such Fq(t). Moreover, de Jong gave heuristics that in hindsight 
hint that Average(# Sela) = 4 not only for Fg(t) but also for Q. 

The approaches of de Jong and Bhargava-Shankar are similar. Namely, both count 
integral models of geometric objects representing elements of Sel„(i?). (For 7i = 3, these 
objects are plane cubic curves.) But the more delicate estimates, essential for obtaining 
an asymptotically sharp upper bound on YIeg,^ - "^ ^^^2 or 'iiE) and a matching lower 
bound (for Sel2), are unique to Bhargava-Shankar. 



3. BACKGROUND: n-DIAGRAMS AND BINARY QUARTIC FORMS 

Each element of H^(Q, ii^[n]) has a geometric avatar, called an n-diagram. To count 
Selmer group elements, we will count the possibilities for the coefficients of the poly- 
nomial equations defining their avatars. We follow |CFO"'"08| §1.3] in Sections 13. 1H3.3[ 



and |BSD63) in Section 13.51 The goal of this section is ([10 

3.1. Diagrams 

Fix a field /c, a separable closure A;, and an elliptic curve E over k. A diagram for 
E' is a morphism of varieties from an E'-torsor C to a variety S. An isomorphism of 
diagrams is given by an isomorphism of E'-torsors C ^ C and an isomorphism of 
varieties S* — ?■ 5" making the obvious square commute. 

3.2. n-diagrams 

Let O G E{k) be the identity. Fix an integer n > 2 with charfc \ n. The trivial 
n-diagram is the diagram E — )■ P"-~i determined by the linear system |nO|, where E 
is viewed as trivial E'-torsor. More generally, an n-diagram is a twist C — )■ S* of the 
trivial n-diagram, i.e., a diagram that becomes isomorphic to the trivial n-diagram after 
base extension of both to k. In particular, S must be a Brauer-Severi variety, a twist 
of projective space. (For this reason, n-diagrams are called Brauer-Severi diagrams 
in 



CFO+08[ §1.3].) 



The automorphism group of the trivial n-diagram over k is given by E[n] acting as 
translations on E and acting compatibly on P"~^. Galois descent theory then yields a 
bijection 
(7) {n-diagrams for E} ^^,^^^^^^^^_ 

isomorphism 

Remark 3.1. — Elements of H^(/c,£'[n]) are in bijection also with geometric objects 
called n-coverings |CFO"'"08[ §1.2]. But it is the n-diagrams that are easiest to count. 



Remark 3.2. — The action of E[n] on P" ^ is given by an injective homomorphism 
E[n] -» PGL„. 
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3.3. Solvable and locally solvable n-diagrams 

The homomorphism B.^{k,E[n]) — > }i^{k,E) corresponds to sending an n-diagram 
C ^ 5* to the torsor C. Its kernel, which is isomorphic to E{k)/nE{k) (cf. ([T])), corre- 
sponds to the set of n-diagrams C — )■ S* for which C has a /c-point; such n-diagrams are 
called solvable: 

{solvable n-diagranis for E} E{k) 



isomorphism nEykj 

An n-diagram C — t- S over Q is locally solvable if C has a Qp-point for all p < oo. In 
this case, 5" too has a Qp-point for all p < oo, and hence S ^ P"~^, by the local-global 
principle for the Brauer group. (Not every n-diagram with S ~ p"-i is locally solvable, 
however.) By ([1]), we have a bijection 

. , {locally solvable n-diagrams for E} n i / t-in 

(9j < — > beln{E). 

isomorphism 

3.4. Binary quartic forms 

With an eye towards Section [3.5^ we consider the space Q[x,|/]4 of binary quartic 
forms. There is a left action of GL2(Q) on Q[x,2/]4 given by (7 ■ f){x,y) := f{{x,y)^) 
(we view {x,y) as a row vector). This induces an action of GL2(Q) on the algebra 
Q[a, b, c, d, e] of polynomial functions in the coefficients of / := ax* + bx^y + cx^y"^ + 
dxy^ + ey*. The subalgebra of SLs-invariants is Q[a, fe, c, d, e]^^^ = q[/^ j] = Q[yi^ 5]^ 
where 

/ := 12ae - ?,bd + c^ A:= -1/3 

J := 72ace + 9bcd - 27ad^ - 2766^ - 2c^ B := -J/27. 

(Why —1/3 and —1/27? To make the Jacobian statement in Section 13.51 true.) A 
quartic form is separable if and only if its discriminant A := —{4:A^+27B'^) is nonzero. If 
7 e GL2(Q) and / E Q[x,yU, then ^(7-/) = (det7)M(/) and 5(7-/) = (det7)'5(/). 
Thus the twisted GL2-action 'j* f := (det 7) "^(7 ■ /) induces a PGL2-action preserving 
A and B. 

3.5. Locally solvable 2-diagrams and binary quartic forms 

Let / G Q[a:, I/J4. Let C be the curve z"^ = f{x,y) in the weighted projective plane 
P(l, 1, 2). Let JacC be its Jacobian. Then there is an isomorphism between JacC and 
the elliptic curve y"^ = x^ + A{f)x+B{f), and the isomorphism depends algebraically on 



the coefficients of /. In fact, this makes C —4 P^ a 2-diagram. There is an approximate 
converse: any locally solvable 2-diagram for i^^ is a degree 2 morphism C — ?> P^ equipped 
with an isomorphism JacC — ?> E such that C has a Qp-point for all p < 00; such a 
curve C is given by z"^ = f{x,y) in P(l, 1,2), for some f{x,y) G Q[x,|/]4. 

Two locally solvable 2-diagrams are isomorphic if and only if the associated mor- 
phisms C — > P^ and C — > P^ are isomorphic forgetting the torsor structures (if there 
exists an isomorphism of such morphisms, there is also an isomorphism respecting the 



1049-09 



torsor structures, because the automorphism group of E over Q is never larger than 
{±1}). And two such morphisms are isomorphic if and only if corresponding quar- 
tic forms are PGL2(Q)-equivalent after multiplying one by an element of Q^^. If the 
quartic forms already have the same invariants, then the element of Q^^ is unnecessary. 
Let V = SpecQ[a,b,c,d,e] be the moduli space of quartic forms, and let S = 
SpecQ[A,B]. Let V G V and iS' C 5 be the open subvarieties defined by A 7^ 0. 
There is a morphism V ^ S taking a quartic form to its pair of invariants. Let Vab be 
the fiber above {A, B) e S'{Q). Define the locally solvable subset V(Q)'' C V'{Q) as 
the set of / G Q[x, y]^ with A 7^ such that z"^ = /(x, y) has a Qp-point for all p < 00. 
Define VABiQY^ similarly. For each {A,B) G iS'(Q), combining the previous paragraph 
with ([9]) for n = 2 and E = Eab yields a bijection 

(10) PGUmV^^iQr ^^ SehiEAs). 



Which quartic forms on the left side correspond to the identity in Sel2(-E'A_B)? Those 
with a linear factor over Q. 

Remark 3.3. — Similarly, SbIsIEab) can be related to PGL3(Q)-orbits of ternary cubic 
forms; this is what is used to prove Theorem 11.31 



4. PROOF OF THE THEOREM ON 2-SELMER GROUPS 

Our goal is to sketch the proof of Theorem 11.21 For lack of space, our presentation 
will necessarily omit many details, so the actual proof is more difficult than we might 
make it seem. Also, in contrast to |BS10a] . we will phrase the proof in adelic terms. 



This has advantages and disadvantages as far as exposition is concerned, but does not 
really change any of the key arguments. 

4.1. Strategy of the proof 

For X eR, define 

5(Z)<x := {{A,B) eZ^ : Aj^ 0, {A, B) is minimal and H{A, B) < X] 
^(Q)<x — the subset of V(Q)''' mapping into 5(Z)<x. 
Summing the sizes of the sets in (ITO|) over (A, B) G iS(Z)<x yields 

(12) #(PGL2(Q)\^(Q)<^)= E *^ME). 

From now on, we forget about Selmer groups and estimate the left side of f lT2|) . 

If our job were to estimate the number of integral points in a region Vt C M", we 
would compute the volume of VL and argue that it is a good estimate provided that the 
shape of VL is reasonable. But according to flT2|) . we need to count (orbits of) rational 
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points. So instead of viewing Z as a lattice in M, we view Q as a lattice in the ring of 
adeles 

A := < (xp) E 1 I Qp : Xp G Zp for all but finitely many p > . 

I. p<oo J 

How do we define an adelic region V{Ay^^ whose set of rational points is V^(Q)<j5(^? 
Inspired by (ITT]) , we define 

5(M)<x := { {A, B) e S\R) ■ H{A, B) < X } 

5(Zp)'^'° := ((Zp X Zp) - (/Zp X /Zp)) - (zeros of A) 

cS(A)<x:=5(M)<xX H S{Z, 






finite p 



{ / G V{Qp) : A ^ and 2^ = /(x, y) has a Qp-point } 
ViAf' := V{A) n Yl 



tp) 

p<OD 



V{Ay^x '■= the subset of F(A)''' mapping into 5(A)<x- 
One might expect the rest of the proof to proceed as follows: 
1. Define an adelic measure on PGL2(0)\ '^ ''<^ ^^^ compute its volume. 



2. Show that ^ ( PGL2(Q)\^'^''<^ ) ^^ ^^^ approximated by that adelic volume. 

But statement 2 turns out to be false! It will be salvaged by excluding the quartic forms 
with a linear factor, i.e., those corresponding to the identity in a Selmer group. In other 
words, it is J2Ee<? ^^ Sel2(-E') — 1) that is approximated by the adelic volume. 

4.2. Computing the adelic volume 

The space Qp has the usual Haar measure /Xp (Lebesgue measure if p = oo). The adelic 
measure on A is the product of these. The product Ilfinite p/^p('^(^p)™"^) converges, so 
5(A)<x inherits an adelic measure from A^. In fact, iJ.^{S{R)<x) = 4-4"^/^27"^/^X^/'' 
(area of a rectangle) and /ip(iS(Qp)™"^) = 1 — j9~^p~^, and the product is /i(iS(A)<x) = 
cX5/6, where c := 2^/^3-^/Xi^Oy^- 

Although y(A)^'^ C A^ is a restricted direct product and not a direct product, it is 
a union of direct products that can be given an adelic measure as above. The action 

of PGL2(Q) is measure-preserving, so the quotient PGL2(Q)\^^ -^"^^ inherits the 
measure. 

Let S ^ S' he the universal elliptic curve in short Weierstrass form. Let W be the 
moduli space of pairs (/, P) where / is a quartic form of nonzero discriminant and P 
is a point on z'^ = f{x,y). The group scheme PGL2,5' acts on W (transforming both 
/ and P). The forgetful i5'-morphism F: W ^ V is an S-toTsoi, and is PGL2,5'- 
equivariant. In fact, W ^ S' is a homogeneous space under S x PGL25'. The stabilizer 

S' 

of {x^y + Axy^ + By'^ , (1:0: 0)) is S[2] embedded diagonally (see Remark l3.2p . so ly ~ 
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ISx PGL2,5' j / g[2]- The quotient 5'-morphism q: W ^ PGL2,5' \W ~ £/£[2] ~ £ 



is a PGL2-torsor, and it turns out to admit a rational section. 
We obtain a commutative (but not cartesian) diagram 



W^^£ 



V S'. 

Let W{A)^x be the subset of W (UpKoc^p) i^^^ of W{Ay.) mapping into V{Ay^j^. 

Let £{A)^x be the subset oi £ [Y[p<oo^p) mapping into S{A)^x- 

We define the measure of a subset of W{A)^x by integrating over V{Ay^^ the 
measure of the fibers of F, where each full fiber, an ii^^5(A)-torsor, is assigned the 
mass 1 Haar measure. Define a measure on £[A)^x in the same way by integrating 
over (S(A)<x- It turns out that the fibers of W{A)^x -^ £{A)^x outside a measure- 
zero subset are PGL2(A)-torsors, and that the Tamagawa measure /xxam on these torsors 
is compatible with the measures on W{A)^x and £{A)^x- 
Now consider 

PGL2fO)\^(A)<^^-^(A)<x 



PGL2(Q)\^(^)<^ -5(A)<x. 

Working counterclockwise from S{A)^x, we have 

/i(^(A)<x) = /i(5(A)<x) 
/^(PGL2(Q)\^(A)<^) =/iTam(PGL2(A)/pQL2^Q)) /i(^(A)<x) 

/" (PGL2(Q)\^(^)<^) = /" (PGL2(Q)\^^^)<^ 
and the Tamagawa number 

r(PGL2) := /ixam (PGL2(A)/pQL2i 
is known to be 2, so 

(13) /i (PGL2(Q)\^(A)<^) = 2 /i(5(A)<x) = 2cX'/'. 

4.3. Counting rational points in adelic regions 

Proposition 4.1 (Denominator for Average (# Sel2 -1)). — As X —)■ oo, 

J2 l = #5(Z)<x = (l + o(l))^(5(A)<x). 
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Proof. — The first equality is trivial. Now, ^S{Z)^x is the number of integral points 
{A, B) in a large rectangle that remain after sieving out those satisfying p'^\A and p^\B 
for some prime p and discarding those with AA^ + 27B^ = 0. The sieving is elementary, 
and can be handled either by a Mobius inversion argument |Bru92t Lemma 4.3], or by 
sieving at the first few primes with the Chinese remainder theorem and then arguing 
that the number of points removed by sieving at all the remaining large primes is 
negligible. This leaves (1 + o{l))cX^/^ points. The {A,B) with 4A^ + 27B^ = 
have the form {—3n'^,2n^); there are only 0{X^^^) such points of height up to X, so 
discarding them does not affect the asymptotics. (For related calculations over number 
fields, see |Bek04| .) D 

Let V(Q)<Y "^ be the set of / G V^(Q)<j5f that have no rational linear factor. Most 
of the rest of the section will be devoted to the proof of the following: 

Proposition 4.2 (Numerator for Average(#Sel2-l)). — As X -^ oo, 
J2 (#SeME) - 1) = # (pGL.(Q)\^W)<i^) 



Ee<ff<x 



(l + o(l))^(pGL2(Q)\^(A)<^)- 



Ideally, we could choose a fundamental domain J-" for the action of PGL2(Q) on 
V(Ay^-^ and simply count the rational points of ^(Q),!'^ ^ in it- In an attempt to 
construct such an J-" we use the theory of integral models of binary quartic forms. 

Lemma 4.3 (Existence of integral models |BSD63[ Lemmas 3, 4, and 5]) 

Any quartic form f G Q[x, y] with A G 2^Z and B G 2^Z is PGL2{Q)-equivalent to a 
quartic form in Z[x, y] . 

To avoid some inconsequential technicalities, we ignore the 2^ and 2^ in the rest of 
our exposition. We also ignore the points in ^(Q)<!'jic "^ with a nontrivial stabilizer in 
PGL2(Q): one can show that the contribution from these is negligible. 

Define Z := Ilfimtep^P' ^^^ define V^(Z)^'^ in the obvious way. The proof of 
Lemma [4.31 shows also that every quartic form in V^A^^t^ is PGL2(Q)-equivalent to 
one in V"(M)!5x x ^(Z)'" (if we ignore 2^ and 2^). 

An explicit fundamental domain J-'^ for PGL2(Z)\^ '^ can be obtained by com- 
bining Gauss's fundamental domain for PGL2(Z,)\ ^^ '^ with an easily described 

fundamental domain for PGL2(M)\^^ • By the previous paragraph, if J-'^x '■— 
{/ G J-'^ : H{f) < X}, then every quartic form in V{Ay^j^ is PGL2(Q)-equivalent to 
one in the subset J-" := T^x ^ ^(Z)^*^ (if we ignore 2^ and 2^). Rational points in J-" 
now are integral points in -F<x satisfying locally solvability. 

There are two problems with J-': 

1. The region T^x ^^^ ^ narrow cusp stretching to infinity, which makes it hard to 
approximate its number of integral points by its volume. 
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2. The set T is not a fundamental domain! (Although integral points in T^x cannot 
be PGL2(Z)-equivalent, they can still be PGL2(Q)-equivalent. This phenomenon 
can happen only for quartic forms whose discriminant is divisible by p^ for some 
prime p. For instance, 

24 "^ 22 S4 i4S 22 "^24 

'p X + px y + X y + xy + y and x + x y + x y + pxy + p y 
are PGL2(Q)-equivalent.) 

Problem 1 is solved by an idea from |Bha05[ §2.2], namely to average over a "compact 
continuum" of PGL2(M)-translates of J-'^x- This fattens the cusp enough that the 
volume estimate applies to the "main body" obtained by cutting off most of the cusp. It 
turns out that the severed part contains a disproportionately large number of integral 
points, but they are all from the quartic forms with a rational linear factor; on the 
other hand, the main body contains few quartic forms with a rational linear factor; this 
explains why we exclude them to obtain a count approximated by a volume. 

Problem 2 is more serious. One solution might be to find some way to select one 
PGL2(Z)-orbit of integral quartic forms within each PGL2(Q)-equivalence class. A 
more elegant solution is to select them all, but to weight each one by 1/n where n 
is the number of possibilities. By an argument involving the class number of PGL2 
being 1, this weight turns out to be expressible as a product over all primes p of local 
weights defined analogously in terms of the number of PGL2(Zp)-orbits of quartic form 
over Zp within a PGL2(Qp)-equivalence class. (Strictly speaking, one also needs to take 
into account the orders of stabilizers in defining these weights.) The situation is now 
similar to that in the proof of Proposition 14. H in which we counted integral points with 
a weight that was 1 or according to whether it was minimal (at every prime p) or 
not. If we approximate the actual weights by the product of the local weights at the 
first few primes, then the weighted count of integral points can be approximated by a 
weighted volume. It remains to show that the number of points at which the actual 
weight differs from the approximate weight is negligible. The local weight turns out to 
be 1 whenever p"^ \ A, so it suffices to sum the following bound over all primes p beyond 
a large number: 

Lemma 4.4 ( |BS10a) ). — The number of PGL2{1j)- orbits of integral quartic forms of 
height less than X such that A 7^ and p'^\A is 0{p"'^/^X^/^). 

The proof of Lemma l474l is the trickiest part of the whole argument. The observation 
that A is a polynomial in a, 6, c, d, e is enough to prove Lemma 14.41 for primes p up to 
a small fractional power of X, but it is not known for an arbitrary polynomial how 
to obtain suitable bounds on the number of values divisible by the square of a larger 
prime |Gra98tlPoo03| . Bhargava and Shankar resolve the difficulty in a surprising way: 



using |Woo09t Theorem 4.1.1], they identify the set of quartic forms with the set of 
quartic rings with monogenized cubic resolvent, which admits an (at most 12)-to-l map 
to the much larger set of quartic rings with cubic resolvent, which is in bijection with 
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(GL2(Z) X SL3(Z))-orbits of pairs of ternary quadratic forms |Bha04t Theorem 1]. Then 
they do the counting in this larger set, whose size was calculated in |Bha05t Theorem 7]. 
This concludes the sketch of the proof of Proposition 14.21 

Remark 4-5. — The role of Lemma S3] is to ensure that we are not overcounting orbits. 
Without Lemma [4. 4[ we could still deduce Average (# Sel2) < 3. 

Remark 4-6. — Calculations related to Lemma 14.41 are used in |BS10a] to compute 
not only Average(# Sel2), but also other averages, such as the average size of the 2- 
torsion subgroup of the class group of a maximal cubic order equipped with an element 
generating it as a ring. 

4.4. End of proof 

Dividing Proposition 14. 2l bv Proposition 14. II and using the volume relation (IT3|) yields 

Average (# Sela -1) = r(PGL2) = 2. 
Addl. 
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